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REMARKS 

Status of The Claims 

Claims 46-63, 66-72, 74-78, 80-83, 87-89, 92-95, 98, and 111-122 are pending. Claims 
59, 62, 63, 111,1 14, and 115 have been allowed. Claims 46-58, 60, 61, 66-72, 74-78, 80-83, 87- 
89, 92-95,98, 112, 113, and 116-122 have been rejected. 

Claims 53, 80 and 90 were withdrawn from consideration. The Examiner reconsidered 
the examination of claims 53, 80, and 90, and they are now under examination. 

Claims 71, 89, 93 have been amended. 

Claims 71(e) and 93 were amended to recite a polypeptide having "up to 147" amino 
acids. Support for this amendment can be found, for example, in Figure A-2. See, specifically, 
R6X. 

Claim 89 was amended to change dependency. 

No new matter has been added by way of these amendments. 

The Objections to the Specification Should Be Withdrawn 

Page 57 of the specification was objected to for requiring a sequence identifier. The 
specification has been amended and each primer sequence now has a sequence identifier. A 
revised sequence listing containing these primers is filed concurrently herewith. The objection 
has been obviated. 

Page 64 of the specification was objected to for having an incomplete citation for 
Rosenow et al. The specification has been amended to remove reference to this paper, and the 
objection has been obviated. 

Page 14, line 15 of the specification recited an amino acid substitution as amino acid 
"2999." The Examiner is correct that this is a typographical error. The specification has been 
amended to recite "299," and the objection has been obviated. 

The Examiner further requested trademarks appearing in the specification be noted. 

Pages 57 and 58 have been amended to add the trademark designations. 
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The Rejection of the Claims Under §112. Second Paragraph. Should Be Withdrawn 

Claims 46-58, 60, 66-72, 74-78, 80-83, 87-89, 92-95, 112, and 116-122 were rejected 
under 35 U.S.C. §112, second paragraph, for indefiniteness. This rejection is respectfully 
traversed. 

I. Claims 46-58, 69-72, 74-78, 80-83, 87-89, 92-94, and 116-122 were rejected for 
indefiniteness for the recitation of "an amino acid sequence". The Examiner questions if the 
Applicant intended to claim the entire amino acid sequence or a portion of the sequence. This 
rejection is respectfully traversed. 

The claims at issue recite a polypeptide "comprising an amino acid sequence as set forth 
in SEQ ID NO:X." This open language is routinely used to claim polypeptides and clearly 
recites that the polypeptides embraced by the claims will have the amino acid sequence as shown 
in the recited sequence identifier. Fragments of the recited polypeptides are not encompassed. 
The claims are clear and the Examiner is respectfully requested to withdraw the rejection. 

II. Claims 71 and 72 were rejected for indefiniteness for the recitation of "capable." The 
Examiner asserts that "an element 'capable' of performing a function is not a positive limitation 
but only requires the ability to perform." The rejection is respectfully traversed. 

Claims 71 and 72 recite, in part, a polypeptide having a recited amino acid sequence 
"wherein said polypeptide interacts with an antibody, said antibody is capable of interacting with 
a full-length CbpA polypeptide." The Examiner’s attention is drawn to MPEP 2173.05(g) that 
discusses the acceptability of functional language. The Examiner is respectfully requested to 
consider In re Barr, 170 USPQ 33 (CCPA 1971) which held that the limitation used to define a 
radical on a chemical compound as "incapable of forming a dye with said oxidizing developing 
agent" although functional, was perfectly acceptable because it set definite boundaries on the 
patent protection sought. 

In the instant case, the claims at issue employ language that further characterizes the 
function of an antibody by reciting that the antibody is "capable of interacting with a full-length 
CbpA polypeptide." To determine the acceptability of claim language, one must determine if 
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one of skill in the art would understand what is claimed, in view of the specification. The 
functional language appearing in claims 71 and 72 is clear, and the Examiner is respectfully 
requested to withdraw the rejection. 

III. Claims 51, 52, 60, 66, and 112 were rejected under 35 U.S.C. §112, second paragraph, 
for indefiniteness. This rejection is respectfully traversed. 

The claims were rejected for the term "up to 475 amino acids." Specifically, the 
Examiner questions if the claim is drawn to a polypeptide that is 0 up to 475 amino acids or 109 
amino acids (the length of SEQ ID NO: 5) up to 475 amino acids. The confusion appears to stem 
from the Examiners assertion that the term "an amino acid sequence" as recited in claim 46 can 
be shorter than the sequence appearing in SEQ ID NO:5. This interpretation of the claim 
language is incorrect. Claims 46 and 5 1 are reproduced below. 

46. An isolated polypeptide comprising an amino acid sequence as set forth in 

SEQ ID NO:5, wherein said polypeptide does not bind to choline. 

5 1 . The isolated polypeptide of claim 46, wherein said polypeptide comprises an 

amino acid sequence having up to 475 amino acids. 

Claim 46 does not encompass fragments. The claim language "comprising an amino acid 
sequence as set forth in SEQ ID NO:5" requires that the polypeptide have the complete amino 
acid sequence as set forth in SEQ LD NO: 5. Additional amino acid sequence^ can be added at 
either temunal end . Claim 5 1 merely limits the maximum number of amino acids that are added 
to the terminal ends of the polypeptide of claim 46. An identical line of reasoning applies for 
claims 52, 60, 66 and 112. Accordingly, the polypeptide recited in claims 51, 52, 60, 66, and 
1 12 is clear, and the Examiner is respectfully requested to withdraw the rejection. 

Claims 57, 58, 67, 68, 74-78, 80, 81-83, 87-89, 92-95, 116-122 and claims 70 
parts (c), (d), (e), 71 parts (a), (b), (c), (d), (e), and 72 parts (c), (d), (e) were also rejected 
under 35 U.S.C. §112, second paragraph, for indefiniteness. This rejection is respectfully 
traversed. As with claims 51, 52, 60, 66, and 1 12 discussed above, each of these claims 
also define a length for the recited polypeptide. The reasons outlined above in item III 
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that clarify the claiming strategy of claims 51, 52, 60, 66, and 112 and are equally 
applicable to claims 57, 58, 67, 68, 74-78, 80, 81-83, 87-89, 92-95, 1 16-122 and claims 
70 parts (c), (d), (e), 71 parts (a), (b), (c), (d), (e), and 72 parts (c), (d), (e), and the 
Examiner is respectfully requested to withdraw the rejection. 

IV. Claim 89 was also rejected under 35 U.S.C. §112, second paragraph, for 
indefiniteness. The Examiner states that claim 89 recites a polypeptide having 376 amino 
acids, but is dependant on claim 87 which recites that the polypeptide has up to 328 
amino acids. Claim 89 has been amended to depend from claim 88. As amended, claim 
89 is a proper narrowing dependant claim, and the Examiner is respectfully requested to 
withdraw the rejection. 

The Rejection of the Claims Under 35 U.S.C. §102 Should Be Withdrawn 

I. Claims 61, 71(f), 98, and 1 13 were rejected under 35 U.S.C. §102(a) as being anticipated 
by Hammerschmidt et al. (1997) Molecular Microbiology 25:1113-11 24. The rej ection is 
respectfully traversed. 

Claims 61, 71(f) and 113 recite a polypeptide having an amino acid sequence set forth in 
SEQ ID NO:3, "wherein said amino acid sequence comprises at least one to 57 amino acid 
substitutions." As illustrated in the Blast alignment provided by the Examiner in the March 30, 
2004 Office Action, the sequence disclosed by Hammerschmidt et al. comprise more than 57 
amino acid substitutions when compared to SEQ ID NO:3. Therefore, Hammerschmidt et al. 
does not anticipate claims 61, 71(f), 98, and 113 and the Examiner is respectfully requested to 
withdraw the rejection. 

II. Claims 54-58, 67-72, 116 and 117 were rejected under 35 U.S.C. § 102(e) as being 
anticipated by U.S. Patent No. 6,573,082. Specifically, the Examiner states that U.S. Patent 
6,573,082 discloses SEQ ID NO:38 which corresponds to SEQ ED NO:4 and SEQ ID NO:22 of 
the instant application. This rejection is respectfully traversed. 





Appl.No.: 09/056,019 
Filed: 4/7/98 

Page 20 

Claims 54-58, 67-72, 116 and 1 17 are drawn to various polypeptides having the amino 
acid sequence set forth in SEQ ID NO:4 or 22 wherein the "polypeptide does not bind choline." 
SEQ ID NO:38 disclosed in U.S. Patent 6,573,082 contains a choline-binding domain and 
therefore does not anticipate the claimed invention. As evidence for the presence of a choline- 
binding domain and choline-binding activity, Applicants provide in Appendix A Femandez- 
Tomero et al. (2001) Nature Structural Biology <5:1020-1024. Femandez-Tomero et al. studies 
the structure of choline-binding proteins and provides in Figure 2, page 1021 a description of the 
consensus sequence for a choline-binding domain. For the Examiner's convenience the Table 1 
is provided below which aligns the six (6) choline binding domains appearing in SEQ ED NO:38 
with the choline-binding domain consensus sequence determined by Femandez-Tomero et al. 


Table 1. Alignment of Choline Binding Domains of SEQ ID NO:38 from U.S. Patent No. 
6,573,082 with the Choline Binding Domain Consensus Sequence 


Choline binding domain 

position 

sequence 

CBD#1 

202-306 

G MWYFYNTDGSMATGWLQNNGS WYY 


CBC#2 

311-326 

Gama. tGwlqnngs wyy! 

CBD#3 

331-346 

GSMA TGWLQNNGS WYY ; 

CBD#4 

351-366 

GSMA TGWLQYNGS WYY : 

CBD#5 

371-386 

GSMA TGWLQYNGS WYY 

CBD#6 

391-406 

GDMA. TGWVKD . GDTWYY 

Global consensus sequence* 


Q.m.. |TG<^ ; !g. . WYY] 


* O symbolizes a hydrophobic residue 


The evidence presented above clearly establishes that the amino acid sequence set forth 
in SEQ ID NO:38 of U.S. Patent No. 6,573,082 contains a choline-binding domain and therefore 
is capable of binding choline. Claims 54-58, 67-72, 116 and 117 recite that the amino acid 
sequence set forth in SEQ ID NO:4 or 22 "does not bind choline," and accordingly, the claims 
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are not anticipated by SEQ ID NO:38 of U.S. Patent No. 6,573,082. The Examiner is 
respectfully requested to withdraw the rejection. 

III. Claim 71, part (a), (d), (e) and claims 88 and 93 were rejected under 35 U.S.C. §102(e) as 
being anticipated by U.S. Patent No. 6,500,613 to Briles et al. Specifically, the Examiner asserts 
U.S. Patent No. 6,500,613 discloses SEQ ID NO:40 and 42 which correspond to SEQ ID NOS:5, 
9, and 10 of the present invention. This rejection is respectfully traversed. 

Claim 71(d) and claim 88 recite an amino acid sequence set forth in SEQ ID NO:9, 
"wherein the amino acid sequence comprises at least one to 57 amino acid substitutions ... and 
comprises up to 376 amino acids." Claims 71(d) and 88 are not anticipated by SEQ ID NO:40 or 
42 of U.S. Patent No. 6,500,613. 

Claims 71(d) and 88 recite that the polypeptide comprises at least one to 57 amino acid 
substitutions. Page 13, lines 7-10 of the instant specification define substitutions as "one or more 
residue specified are replaced by other residues." When SEQ ID NOS:40 and 42 are aligned 
with SEQ ID NO:9, more than 57 amino acid substitutions occur. For the Examiner's 
convenience a Clustal W alignment of SEQ ID NO:9 with SEQ ID NO:40 is provided in 
Appendix B and an alignment of SEQ ID NO:9 with SEQ ID NO:42 is provided in Appendix C. 
Accordingly, claims 71(d) and 88 are not anticipated by U.S. Patent No. 6,500,613, and the 
Examiner is respectfully requested to withdraw the rejection. 

Claim 71(a) recites an amino acid sequence set forth in SEQ ID NO:5, "wherein the 
amino acid sequence comprises at least one to 57 amino acid substitutions... and comprises up to 
398 amino acids." Claim 71(a) is not anticipated by SEQ ID NO:40 or 42 of U.S. Patent No. 
6,500,613. 

Claim 71(a) recites that the polypeptide comprises at least one to 57 amino acid 
substitutions. Page 13, lines 7-10 of the instant specification define substitutions as "one or more 
residue specified are replaced by other residues". When SEQ ID NOS:40 and 42 are aligned 
with SEQ ID NO:5, more than 57 amino acid substitutions occur. For the Examiner's 
convenience a Clustal W alignment of SEQ ID NO:5 with SEQ ID NO:40 is provided in 
Appendix D, and an alignment of SEQ ED NO:5 with SEQ ID NO:42 is provided in Appendix E. 
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Accordingly, claim 71(a) is not anticipated by U.S. Patent No. 6,500,613, and the Examiner is 
respectfully requested to withdraw the rejection. 

Claims 71(e) and 93 recite an amino acid sequence set forth in SEQ ED NO: 10, "wherein 
the amino acid sequence comprises at least one to 57 amino acid substitutions. . .and comprises 
up to 328 amino acids." Claims 71(e) and 93 are not anticipated by SEQ ID NO:40 or 42 of U.S. 
Patent No. 6,500,613. 

SEQ ID NO:40 (total length 588 amino acids) of U.S. Patent 6,500,613 is a fragment of 
SEQ ED NO:42 (total length 864 amino acids). Both SEQ ID NO:40 and 42 differ from SEQ ID 
NO:10 by nine (9) amino acid substitutions. Claims 71(e) and 93 have been amended to recite 
an amino acid sequence set forth in SEQ ID NO: 10, wherein the amino acid sequence comprises 
at least one to 57 amino acid substitutions and comprises "up to 147 amino acids." Support for 
this amendment can be found, for example, in Figure 2A-2. U.S. Patent No. 6,500,613 does not 
disclose a fragment of SEQ ID NO:42 (encoding pspC) that is between 106 and 147 amino acids 
in length and having the amino acid sequence set forth in SEQ ED NO: 10. Accordingly, as 
amended, claims 71(e) and 93 are not anticipated by U.S. Patent No., 6,500,613 and the rejection 
should be withdrawn. 


CONCLUSIONS 

The Examiner is respectfully requested to withdraw the rejections and allow claims 46- 
58, 60, 61, 66-72, 74-78, m 80-83, 87-89, 92-95, 98, 1 12, 1 13, and 116-122. Early notice to this 
effect is solicited. 

It is not believed that extensions of time or fees for net addition of claims are required, 
beyond those that may otherwise be provided for in documents accompanying this paper. 
However, in the event that additional extensions of time are necessary to allow consideration of 
this paper, such extensions are hereby petitioned under 37 CFR § 1.136(a), and any fee required 
therefore (including fees for net addition of claims) is hereby authorized to be charged to Deposit 
Account No. 16-0605. 
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Choline binding proteins are virulence determinants present 
in several Gram-positive bacteria. Because anchorage of these 
proteins to the cell wall through their choline binding domain 
is essential for bacterial virulence, their release from the cell 
surface is considered a powerful target for a weapon against 
these pathogens. The first crystal structure of a choline bind- 
ing domain, from the toxin-releasing enzyme pneumococcal 
major autolysin (LytA), reveals a novel solenoid fold consist- 
ing exclusively of 0 -hairpins that stack to form a left-handed 
superhelix. This unique structure is maintained by choline 
molecules at the hydrophobic interface of consecutive hair- 
pins and may be present in other choline binding proteins that 
share high homology to the repeated motif of the domain. 

Streptococcus pneumoniae (pneumococcus), the leading bacte- 
rial cause of acute respiratory infections, is estimated to result in 
over 6 million deaths every year worldwide from pneumonia, 
meningitis or bacteremia, especially among children and the 
elderly 1 . Moreover, the increasing number of antibiotic -resistant 
strains 2 and the suboptimal clinical efficacy of the available vac- 


cines 3 hamper control of this pathogen. In view of this situation, 
substantial attention has focused on virulence- related pneumo- 
coccal proteins as potential targets for drug design because they 
are common to all serotypes. Among these proteins, the fifteen- 
member family 1 of choline binding proteins (ChBPs) appears as 
a viable target because they are involved in pathogenic processes, 
such as adhesion to host cells, nasopharyngeal colonization and 
bacterial sepsis 4 . Although ChBPs are responsible for a wide 
range of different functions, they all share a highly conserved 
choline binding domain (ChBD) through which they attach 
noncovalently to choline moieties of both teichoic and lipo- 
teichoic acids of the cell surface 5 . This manner of displaying pro- 
teins at the cell surface, described also for other Gram-positive 
bacteria and considered peculiar to them 6 , is essential for bacte- 
rial virulence 7 - 8 . 

The major pneumococcal autolysin (LytA), the first and one 
of the better characterized ChBPs, catalyzes the cleavage of the 
N-acetylmuramoyl-L-alanine bond of the pneumococcal pepti- 
doglycan backbone 9 . LytA is responsible for cellular autolysis, 
through which it mediates release of toxic substances — such as 
the pore-forming toxin pneumoiysin and ceil wall degradation 
products — that damage endothelial and epithelial barriers and 
allow pneumococci to gain access to the bloodstream and dis- 
seminate through the body 10 . The C -terminal moiety of LytA 
(C-LytA), consisting of residues Val 188-Lys 318 with six extra 
amino acids at the N -terminus added during the cloning proce- 
dure, was obtained by protein engineering and purification with 
choline chloride and has been shown to constitute the ChBD of 
LytA 11 in the fully active form of the enzyme 12 . The primary 
sequence of C-LytA is constituted by a tandem of imperfect 20 
residue repeats, known either as P-motifs 13 or cell wall binding 
(CWB) repeats 14 . The presence of this sequence repeat defines 
the family of cell wall binding 1 proteins (Pfam 14 ID code 
PFO 1473). Previous analyses of the primary sequence of the frag- 
ment suggest that it may contain either six P-motifs 13 or four 
CWB repeats 14 , depending on the nature of the basic 20- residue 
consensus pattern. 



Fig. 1 . The crystal structure of LytA ChBD. a. Stereo ribbon diagram of C 
LytA ChBD with hairpins assignment. Hairpins (’hp') are colored cyan, 
whereas the loops connecting them are colored yellow, b, Stereo Ca 
trace of the C-LytA structure from the front view — that is. from the N- 
terminal base of the cylinder. A Ca trace of a different color Is shown 
for each substructure: red. dark blue, green, orange, light blue and 
black for hairpins 1-6, respectively, c. Scheme for the first three steps 
in a complete turn of the spiral staircase, with the same colors as in (b). 

The white arrow in the middle indicates the downstairs (N- to C-termi- 
nus) direction. This figure was prepared using MOLSCRIPT 28 . 
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Fig. 2. Sequence and structural similarities among repeats. 
a, Sequence alignment of the seven ChBRs of LytA using the 
structure criterion and prepared with ALSCRIPT 29 . The repeat 
numbers and the corresponding ranges of amino acids are 
shown on the left. The portions of the sequence that form 
the first and second strands of the hairpins are marked with a 
barreled arrow below. Conserved residues among ChBRs 1-5 j 
are highlighted yellow (>50% conservation) or red (100% : 
conservation). Choline-binding residues are indicated with a 
black trlangie. A consensus for ChBRs 1-5 has been derived, 
with bold letters used for 100% conservatio n, capital letters 
for 80% and small letters for bu% .~ l heTonsensus derived 
from this alignment was used to search for further ChBRs in 
the primary structure of LytA. The search revealed that the 
N-terminus of C-LytA may contain a seventh motif (ChBRO). 
Italicized residues are not visible in the electron density maps 
(ChBRO is not even present in the purified protein). The gen- 
eral consensus derived from the >600 CWB repeats found in 
the Pfam web page 14 has also been included. <X> symbolizes a 
hydrophobic residue, b, Role of the conserved residues (a) in 
the structural motif. ChBR2 has been chosen as example. Only 
the Ca and side chains of conserved residues common to the 
two consensus sequences have been represented with ball- 
and-stick format, c. Stereo view of the superposition of the 
six ChBRs. The same color scheme as in Fig. 1b has been used. 


structural family of solenoids, which are structures that 
contain a superhelical arrangement of repeating structural 
units 15 . According to Kobe-Kajava classification 15 , this new 
fold could be designated as the left-handed pp-3-solenoid. 
As in other known pure P-solenoids 15 , no curvature and 
practically no twist are observed along the staircase axis. 
Despite these similarities, the ChBD of LytA differs from 
described p-solenoids because it is built from individual 
supersecondary bricks — the hairpins — that have their 
own entity. The currently described structure represents a 
novel protein fold, as revealed by DALI 16 . 


In order to show how ChBPs are anchored to the cell wall of 
Gram-positive bacteria and to assist in the design of new drugs 
against the infections of these pathogens, we have solved the struc- 
ture of the C-LytA-choline complex at 2.6 A resolution using the 
multiwavelength anomalous dispersion (MAD) method. 

Architecture of LytA ChBD 

The overall shape of the C-LytA monomer is approximately 
cylindrical (Fig. 1 a) , with a diameter of 25 A and height of -60 A. 
The secondary structure is comprised of six independent p-hair- 
pins, labeled by their position in the primary sequence and each 
. consisting of two antiparallel p-strands connected by a short 
internal loop region (Fig. la, cyan). Analysis of the secondary 
structure revealed that all the p-strands have the same length and 
character (five residues and predominantly hydrophobic). 
(Consecutive hairpins are connected by loops of 8-10 residues 
. (Fig, la, yellow) that contain a type I +G1 p-bulge turn, plus 4-6 
residues mostly in an extended conformation. The hairpins 
extend perpendicularly from the axis towards the surface of the 
Cylinder, as shown by a frontal view of the protein backbone from 
the N-terminal base of the cylinder (Fig. lb). With each succes- 
sive hairpin, a 120° counter-clockwise rotation is introduced so 
that the 1 and i+3 hairpins become superimposed, resulting in a 
left-handed superhelix. Thus, the backbone structure can be 
described as a spiral staircase with three steps per turn (Fig. lc). 
The pitch of the i+3 hairpins superhelix is -30 A; climbing down 
the staircase, each hairpin step would lower us -10 A. 

Based on the described topology, we propose that the ChBD of 
LytA belongs to the recently defined protein three-dimensional 


Relationship to sequence repeats 

The canonical repeat in C-LytA has been proposed to be -20 
residues long, but the repetitive nature of the motifs have made 
precisely fixing their exact limits by sequence analysis diffi- 
cult 131417 . Based on the structure presented here, each repeat is 
proposed to encompass two structural units: a P hairpin and its 
preceding 8- 10-residue connecting loop. Therefore, the six 
repeats of C-LytA can be redefined structurally as choline bind- 
ing repeats (ChBRs) 1-6 (Fig. 2a) . Identifying highly conserved 
residues based on the structural alignment of the first five 
repeats is possible, thereby Hpfinir^ a sequence pattern that 
agrees with the global consensus obtained from a multiple align- 
mertfot >600 CwtJ repeats 14 (Fig. 2b). In Uit r second strand of 
the hairpin, aiumauc residues are strictly conserved, whereas 
less conservation is seen in the N-terminal strand. The preceding 
loop would be expected to show reduced sequence identity. 
However, a Gly residue is always found at position 5 of the motif, 
not only because of topological requirements (a ft conforma- 
tion) of the type I turn but also due to steric hindrance with 
neighboring side chains. 

Although the ChBRs secondary structures are very similar 
(Fig. 2 c), deeper analysis of the structure shows that hairpin 6 
significantly deviates from the tandem repeats. The angle 
between hairpin 5 and 6 is only 95® (instead of 120° for all the 
other hairpins), and the later hairpin is not exactly perpendicu- 
lar to the cylinder axis. This produces both the imperfect overlap 
of hairpin 6 with hairpin 3 (Fig. lb) and a slight bend at the 
C-terminal end of the cylinder that distorts the general fold of 
the superhelix (Fig. la). These sineular structural features of 
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Fig. 3. Chotine binding sites, a, Ribbon diagram of the C-LytA dimer inscribed into the molecular surface. Monomers are highlighted In different col- 
ors: yellow and cyan. ChBSs 1 and 2 of monomer 'a' (yellow) are occupied by DDAO molecules. ChBS3 of monomer b* (cyan) is occupied by the 
(2,2':6\2"-terpyridine)-platinum(ll) used for MAD phasing. The hydrophobic components of choline (labeled 'cho'), DDAO (labeled *ddao') and ter- 
pyridin (labeled 'tpy') molecules, schematized as CPK, occupy small hydrophobic cavities on the surface of the protein, b, Stereo diagram of ChBS4, 
where choline is highlighted in orange. The side chains of the hydrophobic conserved residues forming the cavity are shown in the ball-and-stick for- 
mat. The 2F 0 - F e omit map (green) of the choline molecule is contoured at 1.0 a. 


ChBR6 correlate with its unique primary sequence characteris- 
tics revealed by alignment of the motifs. The hydrophobic 
residues in the second strand of its hairpin are less bulky, and 
this repeat contains a two-residue insertion between the con- 
served Gly residue of the loop and the beginning of the hairpin 
(Fig. 2a). 


Dimer conformation 

The asymmetric unit of C-LytA crystals contains two molecules 
arranged as a dimer throughout their C -terminal regions 
(Fig. 3a). The overall shape of the dimer is reminiscent of a 
boomerang with arm lengths of 50 A and an angle between the 
superhelical arms of -85°. The arms are related by a noncrystal- 
lographic two-fold axis along the bisector of the 85° angle 
defined by them. The boomerang is likely to carry the catalytic 
domains of LytA at the end of its arms. The interaction of the 
monomers involve the predominantly hydrophobic coupling of 
hairpins 6 with the almost perpendicular pairing of hairpins 5 of 
both monomers, burying 1,950 A 2 of surface area per monomer, 
almost a quarter of the accessible surface area of each monomer 
(8,600 A 2 ). The singular characteristics of the architecture of 
ChBR6 described in previous sections minimize steric repul- 
sions and introduce the bend, which is necessary to enhance 
dimerization. Ultracentrifugation has demonstrated that fully 
active LytA and C-LytA — that is, in the presence of choline 
chloride — form a dimer in solution, whereas LytA lacking its 16 
C-terminal residues forms a monomer in the same conditions 18 . 
Moreover, the decrease in the catalytic efficiency of LytA (>90%) 
in this monomeric, truncated form 18 further substantiates the 
physiological relevance of the C-terminal dimerization. 


Choline binding sites 

Choline molecules, which form the headgroups of teichoic and 
lipoteichoic acids in the cell surface, were clearly visible in the 
electron density maps (Fig. 3 b). Four choline binding sites 
(ChBSs) are found per monomer of C-LytA (Fig. 3a). Each site is 
formed by the interface between consecutive hairpin pairs: hair- 
pins 1 and 2 (ChBSl), hairpins 2 and 3 (ChBS2), hairpins 3 and 
4 (ChBS3), and hairpins 4 and 5 (ChBS4). The nature of the 
interaction Is mainly hydrophobic, with the three choline methyl 
groups filling a shallow cavity of -1 5 A 3 constituted by three aro- 
matic residues from the hairpins surrounding the site plus a 
hydrophobic residue (Met or Leu) from an 8-10 residue con- 
necting loop (Fig. 2a, black triangles). A cation-rc interaction 


between the electron-rich systems of the aromatic rings and the 
positive charge of choline enhances the binding 19 . Other struc- 
tures of proteins bound to choline or analogs have been deter- 
mined, including anti phosphoryl -choline 20 and acetylcholine 
binding proteins 21 . All share similar binding pockets to the 
hydrophobic head of the choline molecule. 

Although the basis of the cation-n interaction implies a com- 
mon aromatic binding pocket in these structures, choline in the 
ChBD of LytA appears as an essential requirement for the main- 
tenance of the architecture of the superhelix, shielding the non- 
buried hydrophobic interface between consecutive hairpins 
from the solvent and stacking the hairpins together. This idea is 
supported by experiments showing that the catalytic activity of 
soluble LytA is only possible upon choline binding 12 . Following 
the repetitive structure of C-LytA, an additional choline-bind- 
ing cavity could have been expected to exist between hairpins 5 
and 6, Nevertheless, the special architecture of this region (see 
above) alters the topology of the patch, which then becomes 
stabilized (especially the residues of hairpin 6) by interaction 
with the equivalent region of the other monomer upon dimer- 
ization. Thus, although choline seems to be required for dimer- 
ization 18 , our data suggest that it is not directly involved in the 
assembly of the dimer. 

A potential binding surface 

Like other biologically relevant helices 15 , the structure of the 
C-LytA monomer displays spiral grooves on its surface that are 
generated by the left-handed superhelical twist of the molecule 
(Fig. 4a). These grooves, which connect consecutive ChBSs, 
formed by polar and charged residues (positions 1, 3, 19 and 22 
of the alignment; Fig. 2a), are -10 A long, 7 A deep and have a 
maximal aperture of 15 A. The structure of lipoteichoic acid has 
been deduced by NMR spectroscopy over three different frag- 
ments of the acid and are shown to contain two to eight glycidic 
building blocks, each of which has two phosphocholine groups 5 . 
The distance between the hydrophobic heads of these phospho- 
cholines can be reduced to 10 A by simple torsions. Therefore, the 
groove between each pair of consecutive ChBSs is likely to 
accommodate a glycan unit of the pneumococcal telchoic/lipo- 
teichoic acid. Furthermore, N and O atoms from N-acetylated 
galactosamine residues of the glycan units may establish hydro- 
gen bonds and electrostatic interactions with the corresponding 
polar atoms of the defined residues on the groove surface 
(Fig. 4 b) . This could explain the high affinity of C-LytA for pneu- 
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Fig. 4. A potential binding surface and the relevance to other ChBPs. a. The groove between two consecutive ChBSs is colored green. Choline mole- 
cules are represented with rods. This groove is proposed to shelter the glycan component of teichoic and lipotelchoic adds (b). Figure prepared with 
GRASP 30 , b. Scheme of a possible way of anchoring the glycan component of teichoic/lipoteichoic acids in the groove between two consecutive 
ChBSs. Choline molecules in the currently described structure are schematized as green spheres, whereas a complete repetitive unit of 
teichoic/lipoteichoic acids is in ball-and-stick representation, with C atoms (gray), N atoms (blue) and 0 atoms (red). These N and O atoms may estab- 
lish hydrogen bonds with polar and charged residues (positions 1, 3, 19 and 22 of the alignment; Fig. 2a) at the surface of the groove, c, Eight illus- 
trative proteins containing ChBRs have been schematized. LytA is the one reported here, PspA has a longer ChBD, CbpD has a lower number of 
repeats. LytB carries the ChBD at the N-terminus. Cpl-1 is an enzyme of a bacteriophage, Gtf-I of 5. downei and ToxB of C. difficile have a high num- 
ber of repeats in various tandems, and DsrB of L. mesenteroides carries ChBRs at both sides of the catalytic domain. Although the total number of 
ChBRs (schematized as red-colored boxes) varies greatly among the members of the family (ranging from four to 18), the high sequence conserva- 
tion of hydrophobic residues present in these repeats (Fig. 2a) allows us to propose a common architecture for the ChBD. The catalytic domains with 
known function are green colored; the blue boxes represent putative functional domains. The last repeats in LytA and Cpl-1 are colored yellow to 
highlight their particular sequence and structural characteristics. 


mococcal cell walls, as the combination of single small binding 
sites have been shown to provide high affinity to interactions 22 . 

Relevance to other ChBPs 

The primary sequence of the repeating units (not considering 
ChBR6) is highly conserved within the large number of proteins 
from Gram-positive bacteria and their bacteriophages (50-100 
members, depending on the source) belonging to the cell wall 
binding 1 family 14 . The total number of repeats and their loca- 
tion in the primary sequence greatly varies among the proteins 
in which they are present (Fig. 4c). Genetic analyses 23 suggest 
that the ChBD results from a series of gene duplication events 
that copied the basic repeating unit, as has also been suggested 
for other solenoid proteins 15 . Because the sequence of the ChBRs 
is largely defined by the conservation of hydrophobic residues 
(Fig. 2a; alignment of the >600 identified CWB repeats at the 
Pfam Web page 14 ), repeats found in other ChBPs are likely to 
form a superhelix with the same characteristics as those 
described here for the ChBD of LytA (Fig. 4c). Proteins with a 
low number of ChBRs In tandem (less than three) are not 
expected to have affinity for cell walls, because we show that a 
single choline binding site requires residues from three consecu- 
tive repeats. Longer ChBDs are likely to display additional 
choline binding sites, which would give the protein a higher 
affinity for the pneumococcal cell wall. In two ChBPs (pneumo- 
coccal murein hydrolases LytB and LytC), the ChBD has been 


found at the N-terminus 14 . There is at least one ChBP (dextran- 
sucrase B of Leuconostoc mesenteroides) with ChBRs at both sides 
of the catalytic domain 14 . The solenoid structure described here 
can easily fit the topology of these three ChBPs, because the 
overall symmetrical character of the C-LytA monomers with 
respect to their mass center (ChBR6 not considered) should 
allow them to accommodate the catalytic domain at either end. 
The unique sequence and folding characteristics of ChBR6 sug- 
gest a divergent evolution. Because ChBR6 is not found in 
ChBPs other than LytA or those of several pneumococcal bacte- 
riophages (BLAST results not shown) , whether the remaining 
ChBPs dimerize is uncertain. 

Conclusions 

The first three-dimensional structure for a choline-dependent 
cell wall anchoring domain, which is present in a wide range of 
virulence-related proteins from Gram-positive bacteria, pro- 
vides unique Insights into the mechanism of attachment of 
ChBPs to bacterial cell surface. This structure constitutes a new 
protein fold, the left-handed Pp-3-solenoid spiral staircase, 
which consists exclusively of P-hairpins that stack to form a 
superhelix maintained by choline molecules at hydrophobic cav- 
ities on the protein surface. Furthermore, our structure suggests 
how teichoic and lipoteichoic acids present in the bacterial cell 
surface may specifically recognize and bind to the ChBD. 
Because the virulence of pneumococcus is significantly reduced 
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Table 1 Data collection, phasing and refinement statistics 


Data collection 

Peak 

Inflection 

Remote 

Wavelength (A) 

1.0695 

1.0715 

0.9840 

Resolution (A) 

35.0-2.6 

35.0-2.6 

35.0-2.6 

Measurements 

105.081 

96,176 

96,859 

Unique reflections 

11,378 

11,382 

11,367 

RpM 1 

4.9 (26.6) 

4.7 (24.8) 

5.2 (27.7) 

i/od) 1 

12.3(2.8) 

12.9 (3.0) 

8.1 (2.7) 

Completeness (%)’ 

99.7 (100) 

99.7 (100) 

99.6 (100) 

MAD phasing 




Rcutfii 




iso 

reference 

0.46 

0.44 

ano 

0.74 

0.88 

0.59 

Phasing power 




iso 

reference 

3.3 

3.3 

ano 

2.2 

1.6 

2.9 


Figure of merit 
Before solvent flattening 
After solvent flattening 
Anomalous scatterer 


0.41 

0.84 

Platinum (1 site) 


(DDAO). These crystals were soaked for 2 h in the same buffer solu- 
tion containing 4 mM (2.2 / :6'.2"-terpyridine)-platinum(ll) chloride. 

Data collection and structure determination. Diffraction data 
were collected at 100 K with a MAR345 detector at DESY-X31 beam- 
line, and processed with M0SFLM 2 « and the CCP4 suite 25 . The crys- 
tals belong to the 1222 space group (a = 58.0 A, b = 1 18.2 A and c « 
104.9 A), with two protein molecules per asymmetric unit and a 
56% solvent content (Tablel). 

The heavy atom search performed with CNS 26 found one plat- 
inum site in the asymmetric unit The Pt-MAD phasing and the sub- 
sequent solvent flattening at 2.6 A were performed with SHARP 27 , 
and the generated electron density map was used to build the first 
model. Several steps of simulated annealing and B-factor refine- 
ment against the peak-dataset using CNS 26 were carried out until 
the Rwortt and R*,* values dropped to 21.8% and 28.2%, respectively. 
The average temperature factor for the N-terminal part of the sec- 
ond monomer (Gly 192-Arg 219), which was modeled with the aid 
of noncrystallographic symmetry restriction and averaging, is sig- 
nificantly higher (85.21 A 2 ) than for the corresponding region of 
the other monomer (36.86 A 2 ). 

Coordinates. The coordinates have been deposited in the 
Brookhaven Protein Data Bank (accession number 1HCX). 


Model refinement 

Refinement range (A) 35-2.6 

Reflections 

Work 10,209 

Free 2 1,169 

Rwc*/Rfr«(%) 21.8/28.2 

R.m.s. deviation 

Bond lengths (A) 0.007 

Angles (°) 1,2 

Number of atoms 

Protein 2,122 

Solvent 132 

B-factor (A 2 ) 

Wilson 43.6 

Average 52.7 

Values in parentheses correspond to the highest resolution shell 
(2.74-2.60 A). 

deflexions in the test set represent a 10% of the total number of reflec- 
tions used during refinement. 

when ChBPs are released from the cell wall, the crystal structure 
of the ChBD from LytA bound to choline may represent a new 
lead for developing novel drugs against pneumococcal infec- 
tions. Compounds blocking the ChBSs should emerge as highly 
effective drugs because they would be aimed toward multiple 
targets (the entire set of ChBPs), which usually hinders the 
development of resistances. These drugs are likely to be success- 
ful against the Gram-positive bacteria, such as Clostridium diffi- 
cile and Streptococcus downed, which contain proteins with the 
ChBD. 

Methods 

Protein expression, purification and crystallization. The ChBD 
of the major lytic amidase of 5. pneumoniae was obtained using 
established protocols 11 . Crystals were grown using the sitting drop 
vapor diffusion method at 295 K over a well solution of 30% (w/v) 
PEG 4000 and 0.2 M ammonium acetate buffered with 0.1 M sodi- 
um citrate, pH 6.4, plus 0.4 mM N,N-dimethyl-decylamine-N-oxide 
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APPENDIX B 

SEQ ID N0:9 FROM U.S. App. No. 09/056,019 
with SEQ ID NO:40 from U.S. Patent No. 6,500,613 


CLUSTAX W (1.82) multiple sequence alignment 

SEQ : TO HO FASK5ERKVHYSIRKTSIGVA5VAVASLFI>GGVVHAEGVRSGN^TVTSSGQDISKKYAD 60 

S0SEQXDN09 


SE Q E^SHIiESILKDVKKNEKKVAEAQKKVEEAKKKAEDQKEKDRRNYPTITYKTLELEIAES 120 

S<JSEQrDN09 — 


SEQ D VE VKKAEIiE L VKVKAKESQD E EKI KQ AEAEVES KQ AEATRLKK I KTDRE EAKRKAD AKL 18 0 

SJSEQIDN09 

SEQ KEAVEKNVATSEQDKPKRRAKRGVSGEIATPDKKENDAKSSDSSVGEETXPSPSLNMAN- 239 

SJSEQIDN09 EMEG STQAATSSNMAKT 17 


SEQ 

SGSEQIDN09 


ESQTEHRKDVD E YIKKMLS EIQ LDRRKHTQNVNXNI KLSAI KTKYL YELS VXKENS fOCE E 299 

EHRKAAKQWDEYXEKMLRErQLDRRKHTQNVAIiMIKLSAXKTKYLRELNVIiEEKSK-DE 76 

* Z. Z Z ********* ************* *** 


t********** ** **.*.** - i 


SEQ 

S<JSEQIDN09 


XTSKTKAELTAAFEQFKICDTIjK^EKICVAEAEKKVEEAKKKAKDQKEEDRRNYPTOTYKTL 3 5 9 
LPSE I KAKXiDAAFEKFKKDTLKPGEKVAEAKKKVEEAKKKAEDQKEEDRRN YPTNTYfCTL 136 

*.*: **:* ****.******** j*****.**********.******.**********^ 


SEQ 

SJSEQIDM09 


ELEIAESDVKVKEAELELVKEEANESRNEEKIfCQAKEKVESKKAEATRLEKIKTDRKKAE 419 
ELEIAEFDVKVKEAELELVKEEAKESRKEGTIKQAKEKVESKKAEATRLENIKTDRKKAE 196 
****** **************** : ***** ,*******************.********* 


SEQ 

S0TSEQIXNO9 


EEAKRKAEESEKKAAEAKQKVDAEEYALEAKIAELEYEVQRLEKELKEIDESDSEDYLKE 479 

E EAKRKAD AKL KEANVATS 

*******. * - * * .... 


-DQGfCPKGRAKR 226 


SEQ 

SJSEQIDN09 


GLRAPIiQSKX.DTKKAKLSKXiEELSDKIDELDAEIAKLEVQLKDAEGMNNVEAYFKEGLiEK 5 3 9 

GVPG- ELATPDKKENDAKSS D S 247 

* : •* « : . .. ; ; * : * . : . -.**- 


SEQ 

SJSEQIDM09 


SEQ 

SJSEQIDN09 


TTAEKKAELEKAEADLKKAVDEPETPAPAPQPAPAPEKPAEKPAPAPEKPAPAPEKPAPA 599 

SVGEETL 

254 

- 

PEKPAPAPEKPAPAPEKPAPTPETPKTGWKQEMGMWYFYNTDGSMATGWLQKNGSWYYLN 65 9 


SEQ 

SJSEQIDN09 


SNGAMATGWLQNNGSWYYXJ^SMGAMATGWLQYMGSWYYLNANGDMATGWIiQYKGSfrTYYLN 


719 


?EQ ANGDMATGWFQYNGSWYYXNANGDMATGWFQYNGSWYYLNANGDMATGWLQYNGSWYYLN 779 

SJSEQIDN09 


SE Q SKGAK^TIXJWIiQUMGSWYYI^ANGSMATDWVT<IX3DTWYYXEASGAMKASQWFKVSDKWYYV 839 

. SJSEQIDW09 


SB Q NGSGALAVNTTVDS YRVNANG EWVN 864 

S<JSEQIDK09 



APPENDIX C 

SEQ ID N0:9 FROM U.S. App. No. 09/056,019 
with SEQ ID NO:42 from U.S. Patent No. 6,500,613 


CLUSTAL W (1.62) multiple sequence alignment 


BRILESSEQIDN04 2 EGVRSGNNLTVTSSGQDISKKYADEVESHLESILKDVKKNLKKVQHTQNVGLITKLSEIK 6 0 

SJSEQIDN09 


BRILESSEQIDN042 KKYLYDBKVNVLSEAELTSKTKETKEKLTATFEQFKKDTLPTEPEKKVAEAQKKVEEAKK 120 

SJSEQIDM09 i 


BRILESSEQIDN042 KAEDQKEKDRRNYPTITYKTLELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEAEV 180 

SJSEQIDN09 ■— 1 


BRILESSEQIDN042 ESKQAEATRLKKIKTDREEAKRKADAKLKEAVEKNVATSEQDKPKRRAKRGVSGELATPD 240 

SJSEQIDN09 


BRIEESSEQIDN042 KKENDAKSSDSSVGETLPSPSLNMAN-ESQTEHRKDVDEYIKKMLSEIQLDRRKHTQNVN 2 99 

SJSEQIDN09 . - - ENE GSTQAATSSNMAKTEHRKAAKQVVDEYIEKMLREIQLDRRKHTQNVA 5 0 

* „ * .2.* ***: * :: ********* ************* 


BRILESSEQIDN042 ^XKXSAIKTKYLYELSVLKENSKKEELTSKTKAELTAAFEQFKKDTLKPEKKVAEAEKK 3 59 

SJSEQIDN09 LNIKLSAI fCTKYLRELNVLEEKS K - DELPS E I KAKLDAAFEKFKKDTIiKPGEKVAEAKKK 109 

************* ********* ^ * ** * * **.* ************* ********* 


BRILESSEQIDN042 VEEAKKKAKDQKEEDRRNYPTMTYKTLELEIAESDVKVKEAELELVKEEAHESRNEEKIK 419 

SJSEQIDN09 VEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAEFDVKVKEAELELVKEEAKESRNEGTIK 169 

********.************************ ********************** * * 


BRIEESSEQIDN042 

SJSEQIDK09 


QAKEKVESKKAEATREEKIKTDRKKAEEEAKRKAEESEKKAAEAKQKVDAEEYALEAKIA 479 

Q AKE KVES KKAEATRLEN I KTDRKfCAE E EAKRKADAKLKEANVATS 215 

*************** ** ***************** . __ * ; * : - . . 


BRIBES S EQ IDN04 2 
SJSEQIDN09 


ELEYEVQRLLKELKEIDESDSEDYLKEGLRAPLQSKLDTKKAKLSKIiEELSDKIDELDAE 539 


DQGKPKGRAKRGVPG E 231 

*. * : , * z * z * 2 • * 


BRILESSEQIDN042 

SJSEQIDN09 


IAKLEVQLKDAEGNNNVEAYFKEGLEKTTAEKKAELEKAEADLKKAVDE 588 
BATPD KKEND AKS S : DSSVGEETL 254 


. * 


- * * * 



APPENDIX D 

SEQ ID N0:5 FROM U.S. App. No. 09/056,019 
with SEQ ID NO:40 from U.S. Patent No. 6,500,613 


CLUSTAL W (1.82) multiple sequence alignment 


SEQ.1()l)C> WCP 
SJSEQIDNOS 


FASKSERKV7TYSIRKFSIGVASVAVASLFLGGVVHAEGVRSGNNLTVTSSGQDISKKYAD 

TEPGE 

. * * . 


SEQ 

SJSEQIDNOS 


SEQ 

SJSEQIDNOS 


EVESHLESILKDVKKNEKK^AEAQKKVEEAKKKAEDQKEKDRRNYPTITYKTLELEIAES 
KVAEAKKKVEEAEKKAKDQKEEDRRNYPTITYKTLELEIAES 
****** ****** ; *** ; **** . ******** ************ 

D VE VKKAE LE L VKVKAKESQD E E KI KQ AE AE VES KQ AEATRL KKI KTDREEAKRKAD AKL 
DVEVKKAELELVTCVKANEPRDEQKI KQAEAEVES KQAEATRLKKIKTDREEAEEEA- - — 

****************.*^ r**;*****************************. 


SEQ 

SJSEQIDNOS 

SEQ 

SJSEQIDNOS 


KEAVEKNVATSEQDKPKRRAKRGVSGELATPDKKENDAKSSDSSVGEETLPSPSLNMANE 

fCRRADA 


* * * * 



SEQ 

SJSEQIDNOS 


TS KTKAELTAAF EQ FKKDTLKPEKKVAE AEKKVEEAKKKAKDQ KEEDRRNYPTNT YKTLE 


SEQ 

SJSEQIDNOS 


LEIAESDVKVKEAELELVKEEANESRNEEKIKQAKEKVESKKAEATRLEKIKTDRKKAEE 


SEQ 

SJSEQIDNOS 


EAKRKAEESEKKAAEAKQKVDAEEYALEAKIAELEYEVQRLEKELKEIDESDSEDYLKEG 


SEQ 

SJSEQIDNOS 


LRAP LQS KLDTKKAKLS KLEELS D KI D ELD AE I AKLEVQ LKD AEGNNNVEAYF KEG LE KT 


SEQ 

SJSEQIDNOS 


TAE KKAE LE KAE ADLfOCAVD E PETP AP APQ P AP APEKPAE KP APAPE KP APAP E KP AP AP 


SEQ 

SJSEQIDNOS 


EKPAPAPEKPAPAPEKPAPTPETPKTGWKQENGMWYFYNTDGSMATGWLQNNG^fYYiNS 


SEQ 

SJSEQIDNOS 


NGAMATGWLQNNGS WYYLNSNGAMATXJWLQ YNGS^QT^NANGDMATGfTLQ YNGS^f^tLNA 



SEQ 

SJSEQIDN05 

SEQ 


NGANVTGVN^NNGSfiY X|LNANGSMATDWVKDGD^nn^iEASGAMKASQWFKVS D^0^VN 
GSGAIAVNTTVDS YRVNANGEW VN 8 €4 


60 

S 

120 

47 

180 

103 

240 

109 

30 ( 5 “ 


360 


420 


480 


540 


600 


660 


720 


780 


840 



'A 


APPENDIX E 

SEQ ID N0:5 FROM U.S. App. No. 09/056,019 
with SEQ ID NO:42 from U.S. Patent No. 6,500,613 


CLUSTAL W (1.82) multiple sequence alignment 


BRILESSEQIDN042 EGVRSGNNLTVTSSGQDISKKYADEVESHLESILKDVKKNLKKVQHTQNVGLITKDSEIK 60 

SJSEQIDNOS 


« BRILESSEQIDN042 KKYIiYDLKVNVLSEAELTSKTKETKEKLTATFEQFKKDTLPTEPEKKVAEAQKKVEEAKK 120 

SJSEQIDNOS TEPGEKVAEAKKKVEEAEK 19 

*** -ft****.****#*.* 


BRILESSEQIDN042 KAEDQKEKDRRNY PT IT YKTLELE I AES DVE VKKAELEL VKVKAKESQDEEKI KQ AEAEV 18 0 

SJSEQIDN05 KAKDQKEEDRRNYPTXTYKTLELEIAESDVEVKKAEIiELVKVKANEPRDEQKIKQAEAEV 79 

t*.****.************************************.*^ .**,********* 


BRILESSEQIDN042 ESKQAEATRL FCKI KTDREEAKRKADAKL KEAVEKNVATSEQDKPKRRAKRG VSG ELAT PD 240 

SJSEQIDN05 ESKQAEATRLKKIKTDREEAEEEA -KRRADA 109 

A*******************..* •♦*** 


BRILESSEQIDN042 K^ENDAKSSDSSVGETLPSPSLNMANESQTEHRKDVDEYIKKMLSEIQLDRRKHTQNVNL 300 

SJSEQIDN05 


BRILESSEQIDN042 MIKLSAIKTKYLYELSVLKENSKKEELTSKTKAELTAAFEQFKKDTLKPEKKVAEAEKKV 360 

SJSEQIDNOS 


BRILESSEQIDN042 EEAKKKAKDQKEEDRRNYPTNTYKTLELEIAESDVKVTOEAELELVKEEANESRNEEKIKO 420 

SJSEQIDNOS 


BRILESSEQIDN042 AKEKVESKKAEATRLEKIKTDRKKAEEEAKRKAEESEKKAAEAKQKVDAEEYALEAKIAE 480 

SJSEQIDNOS ______ 


BRILESSEQIDN042 LEYEVQRLLKELKEIDESDSEDYLKEGLRAPLQSKLDTKKAKLSKIjEELSDKIDELDAEI 540 

SJSEQIDNOS 


BRILESSEQ IDN04 2 AKLEVQLKDAEGNNNVEAYFKEGLEfCTTAEKKAELEKAEADLKKAVDE 588 

SJSEQIDNOS 



